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DNA SEQUENCES RELATED TO ISOLATED f ragtt.E * svwnp nyrc 

Technical FHaIH 

The invention relates generally to genetic 
5 diagnosis of humans. In particular, the invention 

concerns identification of individuals having particular 
DNA sequences predictive for Fragile X Syndrome. 

Background Art 

10 Fragile X syndrome is the most common form of 

familial mental retardation and affects about one in 
2,500 children. The syndrome is characterized by the 
presence of a cytogenetically detectable fragile site in 
band q27.3 near the end of the long arm of the X 

15 chromosome which, if not the cause of the disorder, is 
closely associated with it. The diagnostic molecular 
genetics of the Fragile X Syndrome has been reviewed by 
Sutherland, G.R. et al. ( Clinical c^nor, (1990) 37:2-11). 
An additional review is found by Nussbaum, r.l. et al. 

20 (Ann. Rev. Genet. (1986) 20:109-145). 

Identification of the DNA spanning and 
including the fragile site has been reported by Kremer et 
al - (Am. J. Human Gen^ire (1991) 49:656-661) and Heitz, 
et al. (Science (1991) 251: 1236). Characterization of 

25 the fragile site has indicated a particular region of 
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instability within a 5*0 KB EcoRI restriction fragment, 
with the instability segregating with the Fragile X 
genotype (Yu et al., Science (1991) 252:1179) . The 
region of instability has further been localized to a 1 
5 KB Pst I fragment containing a p(CCG) n repeat* The 
Fragile X genotype is characterized by an increased 
amount of unstable DNA that maps to the repeat (Kremer et 
al., Science (1991) 252:1711). The availability of the 
cloned DNA makes possible the use of the DNA as a probe 
10 to detect length polymorphism of the p(CCG) n to 

characterize the genotype of an individual at that locus 
(Kremer et al., supra ) , thereby obviating problems with 
cytogenetic visualization at the fragile site (Webb et 
al., Prenatal Diagnosis (1989) 9:771-781). 
15 Additional diagnostic tools are available in 

the form of polymorphic microsatellite markers linked to 
the fragile site at Xq27.3 (FRAXA) . Richards, et al., 
(Am. J. Hum. Genet. (1991) 48:1051-1057) have described 
polymorphisms associated with length variation in 
20 dinucleotide microsatellite repeats in the vicinity of 

Xq27.3. These markers have a recombination frequency of 
1% and 7%, respectively, in two-point linkage analysis in 
31 Fragile X families. 

Thus, the availability of cloned DNA spanning 
25 the fragile site provides reagents uniquely suited for 
the detection of the Fragile X allele in appropriate 
subjects. Furthermore, techniques of gene therapy could 
be used to replace or compensate for the pathologic 
Fragile X sequence in affected cell types. 

30 

Disclosure of the Invapj^pn 

The invention provides a human DNA sequence 
corresponding to the Fragile X locus and provides a 
source for suitable probes for diagnosis and sequences 
3 5 useful for modification in therapy. The obtention of 



WO 92/12262 



PCT/LS92/00020 



-3- 

this sequence from the fragile site thus permits an 
improvement in diagnostic techniques as well as the 
possibility for genetic manipulation to overcome the 
disorder. 

5 In one aspect, the invention is directed to an 

isolated and purified DNA molecule of no more than 275 kb 
which includes the fragile site. In another aspect, the 
invention is directed to a subsequence contained in this 
larger DNA of no more than 150 kb, which includes the 

10 fragile site. In still another aspect, the invention is 
directed to a DNA probe which crosses the fragile site, 
and to the corresponding normal sequence useful in 
replacement therapy. 

In still other aspects, the invention is 

15 directed to methods to determine the presence or absence 
of the Fragile X allele in a subject which method 
comprises probing DNA isolated from the subject with the 
probe of the invention. Affected individuals appear to 
have an amplification of a (CCG) n repeat sequence at the 

20 fragile site which gives a band of different size than a 
normal individual when Southern blots are probed with the 
probe of the invention. 

In another aspect, the invention is directed to 
oligonucleotides useful as primers in the polymerase 

25 chain reaction amplification of polymorphic 

microsatellite AC repeats closely linked to the Fragile X 
locus. Thus, these primers may be used to identify 
alleles of the microsatellite regions which vary in AC 
repeat length, thereby providing a method for screening 

30 for a microsatellite repeat sequence allele predictive of 
inheritance of the Fragile X allele. 

In still another aspect, the invention is 
directed to methods to correct the fragile site by 
substituting the normal DNA contained in this region or 

3 5 otherwise compensating for this defect, such as by 
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administration of the normal protein product or by 
antibodies directed against the protein product. 

Brief Description of the Drawings 
5 Figure 1 shows a schematic of XTY26, a 280 kb 

plasmid derived from a yeast artificial chromosome (YAC) 
including a 275 kb human DNA sequence spanning the 
fragile site. 

Figure 2 is a diagram depicting the steps taken 
10 in localizing the DNA sequences which comprise the 
fragile site and the variable region. 

Figure 3 depicts the Southern blot analysis of 
SSSRI-digested somatic hybrid cell DNAs with subclone 
X-5, which comprises the 5 kb £coRI fragment from XTY26. 
15 Figure 4 depicts the Southern blot analysis of 

EcoRl -digested DMA from two normal and four unrelated 
Fragile X Syndrome affected males with subclone X-5. 

Figure 5 depicts the Southern blot analysis of 
Pstl-digested DNA from members of the illustrated Fragile 
20 X Syndrome pedigree. 

Figure 6 illustrates the DNA sequence of the 
1.0 kb PstI fragment from a Fragile X Syndrome library. 

Figure 7 illustrates the location of primer 
sequences and polymorphic microsatellite regions for 
25 FRAXAcl, FRAXAC2, and FRAXAC3. 

Figure 8 illustrates the location of subclones 
of the Pst I region of the fragile site. 

Modes of Carrying Out the Invention 

30 

Definitions 

As used herein, "fragile site" refers to a DNA 
sequence which occurs at the Xq27.3 locus on the X 
chromosome in individuals subject to familial mental 
35 retardation associated with Fragile X syndrome. "Fragile 
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X locus" refers to this location whether in normal 
individuals or in persons affected by the condition. 

As used herein, "expression" of fragile X DNA 
refers to cytogenetic or microscopic manifestation of the 
5 fragile site. 

All DNA sequences disclosed herein are intended 
to include complementary sequences, unless otherwise 
indicated. All DNA sequences are written in a 5'-to-3' 
direction and conform to nucleotide symbols recommended 
10 by the IUPAC-IUB Biochemical Nomenclature Commission. 

Construction of xtysk 

A DNA library was constructed from a human 
subject known to contain the Fragile X locus by the 

15 procedure of Reithman H.C. et al. f Proc. Natl. Acad. Sri. 
USA (1989) 86:6240). The procedure is designed to rescue 
telomeres by complementation and was modified by 
digesting the vector pTYACl, which propagates in yeast as 
an "artificial yeast chromosome" (YAC) with BamHI and 

20 EcoRI or Clal to accommodate inserts digested with either 
EcoRI or Taql and -obtained from the human genomic DNA 
described below. This method of construction of the YAC 
library selects for clones which acquire or no longer 
need an additional telomere. A few clones contain true 

25 telomere sequences, others contain segments from 

nontelomeric regions. Circular chromosomes which are 
maintained as such in yeast also satisfy the selection 
(Hieter, P., et al., Cell (1985) 40:381). 

The immediate source of the genomic DNA that 

30 was inserted in the vector was the human/hamster somatic 
cell hybrid X3000.il, described by Nussbaum, R.L. et al. 
(Ann. Rev. Genet. (1986) 20:109-145) which is known to 
contain a region of human X chromosome from band q24 to 
qter which spans Xq27.3 and which is known to have the 

35 abnormal Fragile X from the original human subject. This 
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portion of the X chromosome is translocated onto a 
hamster chromosome in the somatic cell hybrid. The DNA 
from X3000.ll was digested with TaqI, ligated into pTYACI 
and transformed into yeast on selective media. The 
5 resulting library was screened with the pVKl6BI probe 
known to map close to the fragile site as described by 
Abidi, F.E., et al. (Genomics (1990) 7:363) , and only one 
clone, XTY26, was positive. 

Analysis of the XTY26 clone led to the 

10 conclusion that it is a circular YAC with the map shown 

in Figure 1. l£ situ hybridization was used to determine 
that the XTY26 clone spanned the fragile site. Total DNA 
was extracted from yeast cultures containing XTY26 and 
labeled with fluorescence using the technique of Kievits, 

15 T. et al. (Cvtogenet. Cell Genet. (1990) 53:134), and the 
labeled DNA was used as a probe for in situ hybridization 
to metaphase chromosomes expressing Fragile X. The 
location of the fluorescence labeling relative to the 
cytogenetically observable fragile site was observed as 

20 shown in Table 1. The location of the fluorescence on 
the chromosome was -scored as "proximal," "central" or 
"distal." 



25 Table 1 

Location of Signal for Various Probes in 
Relation to the Fragile Site at Xg27,3 

Position of signal in relation to fragile site 

30 Probe Proximal Central Distal Proximal and Distal 

XTY26 11 
VK16 10 
2-34 9 
Do33 0 



10 
2 
3 
0 



39 
0 
0 

10 



8 
0 
0 
0 



35 
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15 



Sequential metaphase spreads from two Fragile X males 
were examined until at least ten X chromosomes expressing 
the fragile site and exhibiting signal from probe 
hybridization had been scored. The position of the 
signal was scored as proximal, central (i.e., overlying 
the gap in the chromosome) or distal to the fragile site. 
Sensitivity and specificity was such that 35-90% of all 
metaphases (depending upon the probe) had yellow 
fluorescent dots on the end of at least one chromatid of 
the X chromosome with virtually no background signal. 

In this context, proximal with respect to the 
fragile X site means closer to the centromere; distal 
with respect to fragile X refers to a location closer to 
the telomere. The majority of the signal was found 
distal to the fragile site, even though the probe VK16 
used to isolate XTY26 was proximal with respect to the 
Fragile X locus in in sjtu hybridization. The finding of 
label over proximal, central and distal sites as shown in 
Table 1 indicates that the clone XTY26 contains DNA 
complementary to areas of DMA throughout the fragile 
region . 

Additional flanking DNA markers known to map 
close to the fragile site, Do33 (DXS465) and 2-34 
(DXS463), described by Rousseau, F. et al. ( Am. J. Hum. 
25 genet. (1991) 48:108-116) were also found to be present 
in XTY26 and their maps for the restriction enzymes 
BamHI, Hindlll and TaqI were identical in both XTY26 and 
human chromosomal DNA. Because the marker Do33 binds to 
DNA distal with respect to the fragile site, and marker 
2-34 binds to DNA proximal with respect to the fragile 
site in in situ hybridization, their presence in the 
XTY26 clone supports the conclusion that the DNA insert 
in the clone spans the fragile region. 

The circularity of XTY26 was verified using 
restriction analysis, and rests on at least four 



20 



30 



35 
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observations . 1) Sail cuts XTY26 only once and maps 
within DXS293 which, according to other digests with 
Nael, maps toward the middle of the human DNA sequence. 
The Sail digest gives only a minimal alteration in the 
5 size of XTY26 as compared to undigested DNA, consistent 
with the slight difference between circular and linear 
DNA of the same mass. 2) DXS293 mapped into the same 
Nrul fragment as 2-34 (140 kb) but to a 120 kb Sfil 
fragment that was different from the 160 kb Sfil fragment 
10 bearing 2-34. The two Sfil fragments (DXS293, 120 kb and 
2-34, 160 kb) equalled the total length of XTY26. 3) In 
addition, 2-34 mapped to within 60 kb of one end of the 
human DNA insert on an Nael digest and also to a 50 kb 
Clal fragment, yet vector sequences which map to the same 
15 60 kb Nael fragment are found on a 80 kb Clal fragment. 
The Clal sites at map positions 5 kb, 55 kb and 205 kb 
indicate the origin of these fragments. 4) A subclone of 
XTY26 has been generated which contains both Do33 and 
vector sequences . This places the vector sequences 
20 between Do33 and 2-34 completing a circle with the human 
DNA insert (Figure l) . 

Most of the restriction endonucleases used to 
generate the pulsed-field gel map of XTY26 contain CpG 
dinucleotides in their recognition sequences. While this 
25 contributes to their underrepresentation in the genome, 
and therefore utility in long range restriction mapping, 
the methylation of mammalian DNA at these sites rendered 
a direct comparison of the XTY26 map to human chromosomal 
DNA all but useless. A fortunate exception was Sfil 
30 whose recognition sequence does not contain CpG and which 
generates a 120 kb Sfil fragment from XTY26 containing 
DSX293 and most of the DNA between this locus and Do33 
(approximately 150 kb) . The same 120 kb Sfil fragment 
was detected in human lymphocyte DNA from a normal 
3 5 individual confirming the integrity of at least a portion 
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of the human DNA sequence in XTY26. The integrity of the 
human insert is further supported by evidence from 
restriction maps of YACs in this area that show the 
probes 2-34 and Do33 markers to be approximately 210 kb 
5 apart. Consistent with these data is the finding that 
these markers are between 230 and 260 kb apart in XTY26. 

XTY26 was deposited at the American Type 
Culture Collection, Rockville, MD, on December 5, 1990, 
under the terms of the Budapest Treaty, and has accession 
10 no. ATCC 74041. 

location of a DNA Probe Spanning Fragile X 

To identify sequences which constitute the 
fragile site and to screen for DNA differences between 

15 normal and Fragile X individuals in the vicinity of the 
fragile site, sequences from XTY26 were used as 
hybridization probes. Localization of the fragile site 
was accomplished by first establishing a contig of 
X-phage subclones between the two closest sequences which 

20 flanked the fragile site. A diagram of the relevant 
portion of XTY26 is shown in Figure 2. 

The VK16 site (which had been utilized to 
isolate XTY26) has been localized proximal to the fragile 
site by in sl£u hybridization (Kremer, E. et al. , Am. J. 

25 Human Qenet . (1991) 49:656-661), incorporated herein by 
reference) . its position in XTY26 is shown in Figure 2 . 
The distal end of the contig was established by initially 
screening the lambda library of XTY26 with an Alu PCR 
product (Nelson, D.L. et al. f Proc. Natl. Acad, sr.i tt^zv 

30 (1989) 86:6686), referred to as Alu2 (Figure 2). The 
subclone #91 was isolated with this probe and was 
subsequently shown by in, situ hybridization to map distal 
to the fragile site. Riboprobes from each end of #91 
were used to "walk away" from this locus and the 

15 direction of the "walk" was established by hybridization 
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back to blots of various restriction enzyme digests of 
XTY26. Each of the lambda subclones between #91 and VK16 
was mapped relative to the fragile site by in situ 
hybridization. 

5 The detailed steps of the above procedure are 

depicted in Figure 2, and are as follows. The letters at 
the beginning of each paragraph refer to the figure. 

A: The "rare-cutter" restriction endonuclease 
map of the yeast artificial chromosome, XTY26, was 

10 determined by pulse-field gel electrophoresis (Kremer, E. 
et ah, Am. J. Human Genet . (1991) 49:656-661). The 
locations of four probes (VK16, 2-34, Do33 and Alu 2) are 
indicated. Alu 2 was generated in a PGR using XTY26 DNA 
as template and the Alu consensus sequence oligo TC-65 

15 (Nelson, D.L. et al., Proc. Natl. Acad. Sci. USA (1989) 

86:6686) as primer. The localization of other probes has 
been reported previously (Kremer, E. et al. , supra ) . 

B and C: A contig of subcloned DNA fragments 
of XTY26 was generated by construction of a partial 

20 Sau3AI digest library in XGEM-3 (Promega) , using the 
manufacturer's protocols and packaging extracts. The 
library was first screened with total human DNA, then the 
plaque-purified array of 108 clones was probed with Alu2 
and VK16. Riboprobes were generated (again using the 

25 manufacturer's protocols and reagents) from the positive 
clones and used to "walk" towards and across the fragile 
site region. The direction of the "walk" was established 
by mapping these riboprobes back to the XTY26 restriction 
map. Each of the subclones was then used in fluorescence 

30 in situ hybridization to localize the fragile site with 
respect to the contig. This localization and its 
approximate boundaries are shown by dashed lines. 

D and E: Each of the clones which flank and 
span the fragile site region, as defined by in situ 

35 hybridization, were used as probes on Southern blots of 
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s matic cell hybrid DNAs. These results confirmed the 
JESfiRI restriction map across this region. The location 
of the breakpoints in hybrids Qix and micro 2 ID are 
indicated by dashed lines. 

F: This shows the restriction endonuclease map 
of the 5 kb EsqRI fragment which demonstrates instability 
in Fragile x individuals. The CpG region is indicated by 
the cluster of "rare-cutter" restriction endonuclease 
recognition sites. 

G: Restriction fragments were used as 
hybridization probes to delineate the region of 
instability. 

The in ?itu hybridization mapping delineated 
the sequences which appeared to "bridge" the fragile site 
to about 15 kb, although the extent and boundaries of 
this region could not be sharply defined. Each of the 
lambda clones which bridged the fragile site was then 
used as a hybridization probe against several somatic 
cell hybrid DNAs. Two of these, Qix and micro 2 ID, had 
been constructed from a Fragile X parent cell line (Y75- 
1B-MI) in a way designed to break the X chromosome at the 
fragile site (Warren, S. et al., Proc. Natl. Acad, sai . 
USA (1990) 87:3856). These hybrids have breakpoints 
which mapped within the same 5 kb EcoR l restriction 
fragment (Figures 2 and 3). 

With respect to Figure 3, chromosomal DNA was 
isolated from the somatic hybrid cell line CY3, which 
contains the Xq26-qter region intact from a normal X 
chromosome (lane 1); Y75-1B-M1 (lane 2): Qix (lane 3): 
Micro 21D (lane 4) and the mouse cell line A9 , which is 
one parent line of CY3 (lane 5) . The chromosomal DNA was 
subjected to cleavage with restriction endonuclease 
EcoRI, subjected to gel electrophoresis, and probed with 
nick-translated \5. The Southern Blot obtained is shown 
in Figure 3. The kb EcoRI fragment normally expected, 
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which contains the Q1X and Micro 2 ID breakpoints and the 
Y75-1B-M1 instability, is arrowed in each lane. This is 
altered in mobility in Q1X, Micro 21D and Y75-1B-M1 as 
shown. The 5.3 and 1.3 kilobase EcoRI fragments flank 
5 the unstable fragment and are present in the Micro 2 ID 
and Q1X hybrids, respectively. 

Cell line Y75-1B-M1 demonstrated an increase in 
size in the common breakpoint fragment from 5 to 5.9 kb. 
It appeared, therefore, that this variation might be 

10 associated with the fragile site, and this hypothesis was 
then tested. 

The \5 subclone containing the 5 kb EcoRI 
fragment was used as a probe on DNA from both normal and 
unrelated Fragile X Syndrome affected males. As depicted 

15 in Figure 4, DNA from four unrelated Fragile X Syndrome 
affected males (lanes 3 to 6) was digested with EcoRI and 
subjected to Southern blot analysis using subclone X-#5 
as probe. Comparison with normal male DNA (lane l) and 
with a normal male from an affected pedigree (lane 2) 

20 revealed the altered mobility of the 5 kb EcoRI fragment 
to one or more high molecular weight bands in each of the 
affected individuals. Accordingly, it has been found 
that unrelated Fragile X Syndrome affected males 
demonstrate instability of DNA sequences at the site 

25 shown in Figure 2 as FRAXA. 

No variation was observed between any normal 
individuals, whereas every Fragile X male showed an 
altered mobility of this sequence. The origin of this 
variability was localized further by using a series of 

30 restriction fragments from the 5 kb EcoRI fragment as 
probes. Fragments A, C and D (Figure 2G) all showed no 
variation between Pst I digests of normal and affected 
individuals (data not shown). The 1.0 kb Pst fragment B 
was found to hybridize to repeat sequences in the human 

35 genome, whereas the 520 base pair fragment E (derived 
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from fragment B) hybridized strongly to a single Pst I 
fragment which again demonstrated variation in size in 
unrelated Fragile X Syndrome affected individuals. Some 
Fragile X Syndrome individuals had from one to six 
5 recognizable bands of varying size and intensity. Others 
had multiple bands which manifested as a smear. In those 
males with only a smear, PCR amplification of the 520 bp 
band from their genomic DNA confirmed that this sequence 
was present and had not been deleted from their genomes 

10 (data not shown) . The number of Fragile X genotype and 
normal DNA samples analyzed and the patterns of 
hybridization seen in them are summarized in Table 2. 
Abnormal bands were seen on Southern Blots EcoRI or PstI 
digests) in 61 Fragile X individuals from 18 families and 

15 48 unrelated controls. 

Table 2 

Single band 2-4 bands Multiple 
of increased of bands of 



20 


size 




increased 
size 


increased 
size 

("smear") 


25 


Males : Affected 


5 


5 


11 




: Transmitting 


3 


1 


1 




Females: Normal 








30 


carriers 


17 


7 


2 




: Affected 


4 


-> 
_/ 


2 




Normal Males (n=26) 


0 


0 


0 


35 


Females (n=22) 


0 


0 


0 
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Males were classifi d as affected by having 
expression of the fragile site and clinical features of 
the Fragile X Syndrome. Transmitting males were 
5 classified by their position in the pedigree or by having 
a high probability, on the basis of flanking DNA 
polymorphisms, of having the Fragile X genotype, and as 
normal by not having either fragile site expression or 
clinical features of the syndrome. Female carriers were 
10 classified as affected or normal on the basis of clinical 
features of the syndrome, regardless of fragile site 
expression. 

The nature of this variable sequence was 
further investigated in Fragile X Syndrome pedigrees, as 
15 depicted in Figure 5. DNA from members of the 

illustrated Fragile X Syndrome pedigree was digested with 
PstI and subjected to Southern blot analysis using 
fragment E as probe. Pedigree symbols : unshaded, normal 
male (square) or female (circle) ; central dot, normal 
20 carrier male (square) or female (circle) not expressing 
Fragile X; half -shaded circle, normal female expressing 
Fragile X; shaded square, retarded fragile X syndrome 
male expressing Fragile X. Normal individuals in 
generation 3 had a less than 2% chance of carrying 
25 Fragile X based upon flanking DNA polymorphisms 

(Sutherland, G.R., and Mulley, J.c, Clinical Genet. 
(1990) 37:2-11). 

This analysis demonstrated segregation of the 
variable sequence with the Fragile X genotype, with 
30 altered mobilities observed in nonpenetrant 

"transmitting" males and carrier females as well as 
affected males. The alteration in mobility varied within 
families where a single band was observed, and in the two 
families studied increased in size from generation to 
35 generation when transmitted by females, but not when 
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transmitted by males, and was larger in affected 
individuals than in normal carriers. The lack of a 
single hybridizing band in some Fragile X genotypes may 
reflect somatic heterogeneity occasionally leading to a 
5 smear, since the probe sequence is known to be present. 
Furthermore, in all cases where a band was observed, the 
variation was manifest as an increase in size, suggesting 
amplification or insertion. These properties suggest 
that the sequences inserted into or amplified from within 

10 the 1 kb Pstl fragment are unstable in Fragile X 

individuals. The molecular basis for the instability is 
not clear because of difficulties in sequence analysis. 
However, the observation of repeat sequences within the 
unstable region suggests that the instability might be 

15 due to variation in the length or number of these 
repeats . 

The restriction map of XTY26 which was derived 
from a Fragile X individual did not appear to differ from 
normal human DNA in the region of instability. This may 
20 be due to an undetected small difference in the size of 

the 1.0 kb Pstl fragment or to deletion of the amplified - 
region during cloning. 

Nature Of the Fragile X-Contai nino FramnmH- 
25 The 1 kb Pst fragment is highly GC rich and in 

Fragile X affected individuals is refractory to PCR 
analysis. A high GC content is reflected in a CpG region 
- which contains recognition sites for several CpG 
containing restriction enzymes. Three of these sites 
30 have been found to be subject to variations in 

methylation status, which segregates with Fragile X 
Syndrome phenotype but not genotype (Vincent, A. et al., 
Jla£ure (1991) 349:624). The finding of sequences at the 
Fragile X locus which exhibit instability (presumably 
35 amplification or insertion) , and which segregate with 
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genotype (regardless of fragile site expression or 
phenotype) , suggests that the degree of size increase in 
these sequences might modulate fragile X expression and 
the associated syndrome. The immediate proximity of the 
5 unstable sequences to a CpG island, denoted p(CCG) n/ 
suggests interference with either the expression of a 
gene or the function of its product, as a molecular basis 
for the disease phenotype. 

The sequence of the 1 kb PstI fragment is shown 
10 in Figure 6. 

Utility of the Fragile X Probe 

The previously mapped markers, Do33 and VK16, 
one distal and one proximal to the Fragile X locus, frame 

15 a 150 kb fragment which contains the fragile site as 
shown in Figure l. Excision of this 150 kb fragment 
provides a more convenient probe than either of the 
closely associated markers. Further restriction and 
mapping of the 150 kb segment results in the preparation 

20 of a probe spanning the fragile site suitable for 
diagnosis. 

The isolated 520 bp segment of the 1 kb Pst 
from the Nhel site of the PstI set fragment forms a 
diagnostic reagent for direct detection of the Fragile X 

25 genotype. It will detect all Fragile X males by the 
altered mobility of a 1 kb PstI band or its apparent 
absence. It will, however, only reliably detect Fragile 
X females where there is a band or bands of altered size 
because, for those females where the abnormal band is a 

30 "smear," the pattern appears to be very similar to that 
of normal females. Testing Fragile X families with this 
probe can be used as a means of Fragile X phenotype 
prediction, as well as genotype identification. 

The fragile site-containing probe is thus used 

35 for diagnosis (e.g., prenatal diagnosis or carrier 
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detection) by standard techn logy utilizing means to 
detect hybridization of the probe under appropriate 
stringency conditions to the abnormal sequence. Any 
suitable means for detection of hybridization can be 
5 used, including radioactive or fluorescent labeling of 
the probe. For effective use as a probe, a fragment of 
the 150 kb segment may be 10 to 10,000 nucleotides in 
length, preferably 50 to 1000 nucleotides in length, more 
preferably 100 1000 nucleotides in length. The probe may 

10 be prepared by enzymatic digestion of a larger fragment 
of DNA or may be synthesized. 

Further, by altering the stringency of the 
conditions of hybridization the sequences corresponding 
to the Fragile X locus can be isolated from normal 

15 subjects, sequenced, and corresponding sequences used in 
genetic therapy to correct this defect. Thus, the 
present invention also provides a method to treat mental 
retardation caused by the presence of a Fragile X locus, 
which method comprises replacing, repairing or 

20 compensating for said fragile site DNA of the X 

chromosome of a subject with the corresponding fragile 
site DNA of a normal chromosome. 

The availability of cloned sequences from the 
Fragile X locus also makes possible the identification of 

25 a protein product encoded by the cloned sequences, such 
proteins may be identified by operably linking the cloned 
sequences to a promoter in an expression vector. Many 
appropriate expression vectors for this purpose are 
widely known in the art. See, for example, Sambrook, et 

30 al w Molecular Cloning : A Labor atory Manual , 1990, Cold 
Spring Harbor Press, Cold Spring Harbor, NY. The protein 
product may be used for diagnostic or therapeutic 
purposes. Thus, for example, the presence, absence, or 
alteration of the protein product may correspond to the 

15 status of an affected individual. Similarly, the protein 
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product from a normal individual may be used to treat an 
affected individual with an altered protein product. 

Furthermore, monoclonal or polyclonal 
antibodies against the protein product may be raised by a 
5 wide variety of techniques widely known in the art. 

These antibodies may be labeled and used in a variety of 
immunoassays , or, as described above, for therapeutic use 
in an affected individual. See, for example, Harlow, et 
al., Antibodies; A Lab oratory Manual , 1988, Cold Spring 
10 Harbor Press, Cold Spring Harbor, NY. 

Isolation of Polymorphic Microsa tellite AC Repeats 
(FRAXAC1 and FRAXAC2) Linked to Fragile X 

The Southern blot hybridization using probes 

described above, while accurate in determining genotype, 

15 is a relatively slow procedure, particularly for prenatal 

diagnosis. Genotype can be determined just as accurately 

by linkage analysis where the fetus is unaffected and 

when informative markers show no recombination with the 

disease locus. When such markers are polymerase chain 

20 

reaction (PCR) -based, then the affection status for 
at-risk pregnancies can be determined much more rapidly 
than with the Southern blot-based test. Therefore, 
characterization of AC repeat sequences in the immediate 
vicinity of the fragile X site p(CCG)„ unstable element 

25 

was undertaken as follows. 

A. Identification of Microsatellite repeat sequence and 
design of PCR Printers 

The 108 \ subclones of the yeast artificial 
chromosome XTY26 were screened in a random-primed 
reaction (Multiprime, Amersham) with synthetic 
poly (AC. GT) (Pharmacia) radioactively labeled with a 32 P- 
dCTP. AC repeat-containing DNA clones were identified by 
hybridization to this probe in 0.5 M sodium phosphate, pH 



WO 92/12262 



PCT/LS92/00020 



-19- 

7.0, 7* SDS (without carrier DNA) at 65°C f r 16 hours 

and by washing at 65 °c for 1 hour in 2x SSC. 

DNA from positive clones was digested with 

either Haelll, Sau3AI, HinFI, Hpall, Rsal, Hinfl or TaqI, 

5 electrophoresed on 1.4% agarose gels, blotted onto nylon 

membranes (GeneScreen Plus, NEN-Dupont) and probed with 
32 

P-poly(AC.GT) as above. Digests which gave a 
hybridizing fragment of less than 600 base pairs were 
chosen for subcloning into M13mpl8 for sequence analysis. 

10 The derived sequences were then used to design synthetic 
oligodeoxyribonucleotide primers suitable for PCR 
analysis of length variation in the AC repeat sequences. 
These sequences for PCR primers were chosen on the basis 
of their apparent uniqueness, their 50% GC composition 

15 and their lack of consecutive G residues which appear to 
interfere with chemical synthesis of 
oligodeoxyribonucleotides . 

The markers from each microsatellite were 
subsequently termed FRAXAC1 (from Xl2) , FRAXAC2 (from 

20 X25) and FRAXAC3 (from \26) . 



B- Heterozygosity of Microsatellit e Regions 

These primers and microsatellite regions were 
used to determine genotypes as follows. PCR incubations 

25 were performed in 10 m! volumes in a Perkin Elmer-Cetus 
thermocycler for 10 cycles at 94 °C for 60s, at 60 °C for 
90s and then 72 °C for 90s, followed by 25 cycles at 94 °C 
for 60s, at 55 °C for 90s, and at 72 °C for 90s. The 
volume was adjusted to 40 /il with formamide loading 

30 buffer (95% formamide, l mM EDTA, 0.01% bromophenyl blue, 
0.01% xylene cyanol) . After denaturation at 90°C for 3 
minutes, 2.5 jil aliquots of each reaction mixture were 
subjected to electrophoresis in 6% polyacrylamide 
denaturing (7 M urea) gels. Genotypes were determined 

35 after autoradiography for 18 hours. Multipoint analysis 
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vas based on genotypes of each marker in the 40 large 
kindred pedigrees of the Centre D ' Etude du Polymorph isme 
Humain (CEPH) and was carried out using the LINKAGE 
(version 4.9) package for use with the CEPH three- 
5 generation families. The observed heterozygosities of 
FRAXAC3 in 18 unrelated females was only 16% and so the 
characterization of this marker was not pursued further. 
The observed heterozygosities of FRAXAC1 and FRAXAC2 were 
found to be 45% and 80%, respectively, in 40 unrelated 
10 females. However, none of the females homozygous for 

FRAXAC2 were heterozygous for FRAXAC1 and so the combined 
observed heterozygosity was also 80% (Table 3) . This 
indicates linkage disequilibrium between the two markers. 



20 
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Table 3 



Marker 



Allele 
(AC) n 



Allele % Heterozygosity 

Frequency Observed Expected* 



FRAXAC1 



10 



FRAXAC2 



15 



20 



19 


0.0625 


18 


0.0125 


17 


0.725 


16 


0.1875 


15 


0.0125 


23 


0.009 


19 


0.018 


18 


0.073 


17 


0.477 


16 


0.193 


15 


0.037 


14 


0.110 


13 


0.083 



45 



43.5 



80 



71 



♦Based on observed allele frequencies. 

25 

c - Genotvpina of frav a.ci and fra?^ ? 

The genotypes of both markers were determined 

in the 40 unrelated families from CEPH,. No recombination 

was observed between them. 
30 Fragile X-affected pedigrees who had 

previously been shown to have recombinants in the 

vicinity of the fragile site were genotyped with FRAXAC2 . 

Of those individuals who were informative, no 

recombination was found between this marker and the 
3 5 Fragile X genotype (as determined by hybridization with a 
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subclone of the PstI fragment described below) or Fragile 
X phenotype (as determined by Fragile X expression and/ or 
intellectual handicap) . Thus, these markers are 
considerably more closely linked to Xq27.3 than the 
5 previously mapped AC repeat sequences VK23AC and VK144AC. 
Analysis with FRAXAC1 was not undertaken because of the 
high degree of linkage disequilibrium between the two 
markers. The subclone used as probe denoted pfxa3 is the 
Nhel to PstI subclone of the PstI band as shown in Figure 
10 8. 



D. Alternati ve Method for Fragile X Diagnosis 

Thus, an alternative approach to rapid 
diagnostic analysis of Fragile X Syndrome would be to use 

15 these tightly linked, highly informative genetic markers. 
Together with the pfxa3 hybridization probe, these new 
FRAXA markers provide a rational approach to prenatal 
diagnosis in Fragile X pedigrees. This involves analysis 
of chorionic villi sample DNA (CVS) with the AC repeat 

20 markers FRAXA1 or FRAXA2.3 to haplotype the FRAXA locus, 
followed by the Southern blots with the pfxa3 as probe to 
detect amplification of the p(CCG) n repeat. The initial 
microsatellite results allow rapid determination of 
unaffected status in 40% of cases whereas the prediction 

25 of phenotype for individuals with the FRAXA genotype will 
be subsequently determined by the size of pfxa3 
hybridizing fragments. 



30 



35 
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CLAIMS 

1. A purified and isolated DNA molecule of 
not more than 275 kb which comprises the human Fragile X 

5 locus . 

2. The DNA molecule of claim l which is 

XTY26. 

10 3. A purified and isolated DNA molecule of 

not more than 150 kb which comprises the human Fragile X 
locus . 

4. The DNA of claim 3 which comprises the 
15 segment of XTY26 located between VK16 and Do33. 

5. A DNA probe suitable for diagnosis of 
Fragile X Syndrome which comprises an oligomeric DNA 
comprising the human Fragile X locus. 



20 



25 



6. The DNA probe of claim 5 which comprises 
a 1 kb Pst fragment of XTY26. 

7. The DNA probe of claim 6 which is pfxa. 



8. The DNA probe of claim 6 which comprises 
the sequence or an effective fragment thereof: 

10 20 30 40 50 

CTGCAGAAAT GGGCGTTCTG GCCCTCGCGA GGCAGTGCGA CCTGTCACCG 
30 60 70 80 90 100 

CCCTTCAGCC TTCCCGCCCT CCACCAAGCC CGCGCACGCC CGGCCCGCGC 
H° 120 130 140 150 

GTCTGTCTTT CGACCCGGCA CCCCGGCCGG TTCCCAGCTG CGCGCATGCC 
160 170 180 190 200 

35 GGCGCTCCCA GGCCACTTGA AGAGAGAGGG CGGGGCCGAG GGGCTGAGCC 

FTTS00B1.WP .15 
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210 220 230 240 250 

GCGGGGGGAG GGAACAGCGT TGATCACGTG ACGTGGTTTC AGTGTTTACA 
260 270 280 290 300 

CCCGCAGCGG GCCCGGGGGT TCGGCCTCAG TCAGGCGCTC AGCTCCGTTT 
5 310 320 330 340 350 

CGTTTCACTT CCGGTGGAGG GCCGCCTCTG AGCGGGCGGC GGGCCGACGG 
360 370 380 390 400 

CGAGCGCGGG CGGCGGCGGC GGCGGCGGCG GCGGCGGCGG CGGCGGCGGC 
410 420 430 440 450 

10 GGCGGCGGTG GCGGCGGCGG CGGCGGCGGC GGCGGCGGCG GCGGCGGCGG 
460 470 480 490 500 

CGGCGGCGGC GGCGGCGGCG GCGGCCCGGA GCCACCTCTT CGGGGGCGGG 
510 520 530 540 550 

CTCCCGGCGC TAGCAGGGCT GAAGAGAAGA TGGAGGAGCT GGTGGTGGAA 
15 560 570 580 580 600 

CTGCGGGGCT CCAATGGCGC TTTCTACAAG GTACTTGGCT CTAGGGCAGG 
610 620 630 640 650 

CCCCATCTTC GCCCTTCCTT CCCTCCCTTT TCTTCTTGGT GTCGGCGGGA 
660 670 680 690 700 

20 GGCAGGCCCG GGGCCCTCTT CCCGAGCACC GCGCCTGGGT GCCAGGGCAC 
710 720 730 740 750 

GCTCGGCGGG ATGTTGTTGG AGGGAAGGAC TGGACTTGGG GCCTGTTGGA 
760 770 780 790 800 

AGCCCCTCTC CGACTCCGAG AGGCCCTAGC GCCTATCGAA ATGAGAGACC 
25 810 820 830 840 850 

AGCGAGGAGA GGGTTCTCTT TCGGCGCCGA GCCCGCCGGG GTGAGCTGGG 
860 870 880 890 900 

GATGGGCGAG GGCCGGCGGC AGGTACTAGA GCCGGGCGGG AAGGGCCGAA 
910 920 930 940 950 

30 ATCGGCGCTA AGTGACGGCG ATGGCTTATT CCCCCTTTCC TAAACATCAT 
960 970 980 990 1000 

CTCCCAGCGG GATCCGGGCC TGTCGTGTGG GTAGTTGTGG AGGAGCGGGG 
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1010 1020 1030 

GGCGCTTCAG CCGGGCCACC TCCTGCAG . 



9. The probe of claim 6 conjugated to label, 

10. The probe of claim 9 wherein the label 
is fluorescent or radioactive. 



11. A purified and isolated DNA comprising 
the sequence: 



10 


20 


30 


40 


50 


CTGCAGAAAT 


GGGCGTTCTG 


GCCCTCGCGA 


GGCAGTGCGA 


CCTGTCACCG 


60 


70 


80 


90 


100 


CCCTTCAGCC 


TTCCCGCCCT 


CCACCAAGCC 


CGCGCACGCC 


CGGCCCGCGC 


110 


120 


130 


140 


150 


GTCTGTCTTT 


CGACCCGGCA 


CCCCGGCCGG 


TTCCCAGCTG 


CGCGCATGCC 


160 


170 


180 


190 


200 


GGCGCTCCCA 


GGCCACTTGA 


AGAGAGAGGG 


CGGGGCCGAG 


GGG CTGAGCC 


210 


220 


230 


240 


250 


GCGGGGGGAG 


GGAACAGCGT 


TGATCACGTG 


ACGTGGTTTC 


AGTGTTTACA 


260 


270 


280 


290 


300 


CCCGCAGCGG 


GCCCGGGGGT 


TCGGCCTCAG 


TCAGGCGCTC 


AGCTCCGTTT 


310 


320 


330 


340 


350 


CGTTTCACTT 


CCGGTGGAGG 


GCCGCCTCTG 


AGCGGGCGGC 


GGGCCGACGG 


360 


370 


380 


390 


400 


CGAGCGCGGG 


CGGCGGCGGC 


GGCGGCGGCG 


GCGGCGGCGG 


CGGCGGCGGC 


410 


420 


430 


, 440 


450 


GGCGGCGGTG 


GCGGCGGCGG 


CGGCGGCGGC 


GGCGGCGGCG 


GCGGCGGCGG 


460 


470 


480 


490 


500 


CGGCGGCGGC 


GGCGGCGGCG 


GCGGCCCGGA 


GCCACCTCTT 


CGGGGGCGGG 


510 


520 


530 


540 


550 


CTCCCGGCGC 


TAGCAGGGCT 


GAAGAGAAGA 


TGGAGGAGCT 


GGTGGTGGAA 


560 


570 


580 


580 


600 


CTGCGGGGCT 


CCAATGGCGC 


TTTCTACAAG 


GTACTTGGCT 


CTAGGGCAGG 
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610 


620 


630 


640 


650 


CCCCATCTTC 


GCCCTTCCTT 


CCCTCCCTTT 


TCTTCTTGGT 


GTCGGCGGGA 


660 


670 


680 


690 


700 


GG CAGGCCCG 


GGGCCCTCTT 


CCCGAGCACC 


GCGCCTGGGT 


GCCAGGGCAC 


710 


720 


730 


740 


750 


GCTCGGCGGG 


ATGTTGTTGG 


AGGGAAGGAC 


TGGACTTGGG 


GCCTGTTGGA 


760 


770 


780 


790 


800 


AGC CCCTCTC 

Aw w www A W A W 


CGACTCCGAG 

WwVAW A WwwXl^7 


£lwwww W A Aw W 


GCCTATCGAA 

www A^l A wwZUl 


ATGAGAGACC 

A A VJAVJAUA W W 


fii n 


85ft 

O m* w 


O JU 




O «J V/ 


Af?PCAf?CACA 

Aw ^» wAww A wA 


OUUl X w A wX X 


X www wwwwwA 


(zcccczccczezcz 

WWW WW WW www 


w X wAw w X www 


O w V 


870 


880 

O O \J 


oon 

O 7 Is 


900 

J w V 


GATGGGCGAG 


GGCCGGCGGC 


AGGTACTAGA 


GCCGGGCGGG 


AAGGGC CG AA 


910 


920 


930 


940 


950 


ATCGGCGCTA 


AGTGACGGCG 


ATGGCTTATT 


CCCCCTTTCC 


TAAACATCAT 


960 


970 


980 


990 


1000 


CTCCCAGCGG 


GATCCGGGCC 


TGTCGTGTGG 


GTAGTTGTGG 


AGGAGCGGGG 


1010 


1020 


1030 






GGCGCTTCAG 


CCGGGCCACC 


TCCTGCAG. 







12. A purified and isolated DNA molecule 
of less than 50 nucleotides flanking a polymorphic 
microsatellite repeat sequence within the DNA molecule of 
claim 1, wherein said DNA molecule is a primer for 
amplification of said microsatellite repeat sequence. 

13 ♦ The DNA molecule of claim 12, wherein 
said DNA molecule comprises the nucleotide sequence 
GATCTAATCA ACATCTATAG ACTTTATT. 

3 0 14. The DNA molecule of claim 12, wherein 

said DNA molecule comprises the nucleotide sequence 
AGGCTTGGA GTGCAGTGGG CAATCT- 
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15. The DNA molecule of claim 12, wherein 
said microsatellite repeat s quence comprises the 
nucleotide sequence 

(GT) n CAGTCTCA CTCTGTCACTC , 
5 wherein n is 1-100. 

16. The DNA molecule of claim 12, wherein 
said DNA molecule comprises the nucleotide sequence 

GACTGCTCCGG AAGTTGAATC CTCA. 



10 



17. The DNA molecule of claim 12 f wherein 

said DNA molecule comprises the nucleotide sequence 
AGACAGGATC TCACTCTGTC ACCTAG. 



15 18. The DNA molecule of claim 12, wherein 

said microsatellite repeat sequence comprises the 
nucleotide sequence 

GTATTT TTGCAAAGTT TGTCTTTCAG 
TATTTTATTT(GT) n ATATATATAT ATTTTTTTTT TTTTTTTTAA, 

20 wherein n is 1-100. 

19. The DNA molecule of claim 12, wherein 
said DNA molecule comprises the nucleotide sequence 

GTACTGTATC AGTTATAACC CTATG. 

25 

20. The DNA molecule of claim 12, wherein 
said DNA molecule comprises the nucleotide sequence 

CAAAT TGAAGGTTTG TGGAAACCT 



30 21 • The DNA molecule of claim 12, wherein 

said microsatellite repeat sequence comprises the 
nucleotide sequence 

TGTGT GTGTGC(GT) n ATGCAT ACCCAAGACT TATCTTATAC 
AGGTATGCCT TGTTTTATTG CACTTTGCAA ATACTGCATT TTTTT, 

3 5 wherein n is 1-100. 
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22. A purified protein product encoded by 
the DNA of claim 1. 

5 23. A monoclonal antibody immunologically 

reactive with the protein of claim 22. 

24. A polyclonal antibody immunologically 
reactive with the protein of claim 22. 

10 

25. A method to diagnose the presence or 
absence of a Fragile X allele in a subject which method 
comprises digesting the DNA of said subject to obtain DNA 
fragments, separating the fragments according to size, 

15 and detecting a fragment spanning a DNA sequence 
comprising said allele among the fragments by 
hybridization to the probe of claim 6. 

26. A method to treat mental retardation 
20 caused by the presence of a Fragile X allele, which 

method comprises replacing, repairing or compensating for 
DNA spanning a DNA sequence comprising said allele of a 
subject with the corresponding DNA sequence of a normal 
chromosome. 

25 

27. A method to treat mental retardation 
caused by the presence of a Fragile X allele, which 
method comprises replacing or compensating for protein 
encoded by the DNA sequence comprising said allele of a 

30 subject with the corresponding protein of a normal 
chromosome. 

28. A method for screening for a Fragile 
X allele in a subject, comprising 

35 



WO 92/12262 



PCT/US92/OOO20 



-29- 

a) designating a microsatellite repeat allele 
closely linked to Xq27.3 in a family, wherein said 
subject is a member of the family; and 

b) screening said subject for presence of the 
5 microsatellite repeat allele closely linked to Xq27.3. 

29. The method of claim 28, wherein 
step (b) further comprises amplification of said 
microsatellite repeat sequence with the primer of 
10 claim 12. 



30. The method of claim 28, further 
comprising detecting amplification of a p(CCG) n repeat at 
said Xq27.3. 

15 

31. The method of claim 28, wherein 
detection of said repeat further comprises hybridization 
with the probe of claim 7. 

20 32. The method of claim 28, wherein said 

microsatellite repeat allele is closely linked to Xq27.3 
in a population. 
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AMEMJED CLAIMS 

[received by the International Bureau on 24 June 1992 (24.06.92); 
5 original claims 8,11,13-21 amended; other claims unchanged (7 pages)] 

- 1. A purified and isolated DNA molecule of not 
more than 275 kb which comprises the human Fragile X locus, 

10 2. The DNA molecule of claim 1 which is XTY26 . 

3. A purified and isolated DNA molecule of not 
more than 150 kb which comprises the human Fragile X locus. 

15 4. The DNA of claim 3 which comprises the segment 

of XTY26 located between VK1 6 and Do33. 

5. A DNA probe suitable for diagnosis of Fragile 
X Syndrome which comprises an oligomeric DNA comprising the 

20 human Fragile X locus. 

6. The DNA probe of claim 5 which comprises a 1 
kb Pst fragment of XTY26. 

25 7. The DNA probe of claim 6 which is pfxa. 



8. The DNA probe of claim 6 which comprises the 
sequence (SEQ ID NO 1) or an effective fragment thereof: 



10 


20 


30 


40 


50 


CTGCAGAAAT 


GGGCGTTCTG 


GCCCTCGCGA 


GGCAGTGCGA 


CCTGTCACCG 


60 


70 


80 


90 


100 


CCCTTCAGCC 


TTCCCGCCCT 


CCACCAAGCC 


CGCGCACGCC 


CGGCCCGCGC 


110 


120 


130 


140 


150 


GTCTGTCTTT 


CGACCCGGCA 


CCCCGGCCGG 


TTTCCCAGCT 


CGCGCATGCC 


160 


170 


180 


190 


200 


GGCGCTCCCA 


GGCCACTTGA 


AGAGAGAGGG 


CGGGGCCGAG 


GGGCTGAGCC 


210 


220 


230 


240 


250 


GCGGGGGGAG 


GGAACAGCGT 


TGATCACGTG 


ACGTGGTTTC 


AGTGTTTACA 


260 


270 


280 


290 


300 


CCCGCAGCGG 


GCCCGGGGGT 


TCGGCCTCAG 


TCAGGCGCTC 


AGCTCCGTTT 


310 


320 


330 


340 


350 
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910 


920 
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950 


ATC66CGCTA 


AGTGACGGCG 


ATGGCTTATT 


CCCCCTTTCC 


TAAACATCAT 


960 


970 


980 


990 


1000 


CTCCCAGCGG 


GATCCGGGCC 


TGTCGTGTGG 


GTAGTTGTGG 


AGGAGCGGGG 



WO 92/12262 



PCT/US92/0O020 



32 

1010 1020 1030 

GGCGCTTCAG CCGGGCCACC TCCTGCAG. 



9. The probe of claim 6 conjugated to label. 

10. The probe of claim 9 wherein the label is 
fluorescent or radioactive. 



11. A purified and isolated DNA comprising the 
10 sequence (SEQ ID NO 1) : 



10 


20 


30 


40 


50 


CTGCAGAAAT 


GGGCGTTCTG 


GCCCTCGCGA 


GGCAGTGCGA 


CCTGTCACCG 


60 


70 


80 


90 


100 


CCCTTCAGCC 


TTCCCCGCCC 


CCACCAAGCC 


CGCGCACGCC 


CGGCCCGCGC 


110 


120 


130 


140 150 


GTCTGTCTTT 


CGACCCGGCA 


CCCCGGCCGG 


TTCCCAGCTG 


CGCGCATGCC 


160 


170 


180 


190 


200 


GGCGCTCCCA 


GGCCACTTGA 


AGAGAGAGGG 


CGGGGCCGAG 


GGGCTGAGCC 


210 


220 


230 


240 


250 


GCGGGGGGAG 


GGAAGAGCGT 


TGATCACGTG 


ACGTGGTTTC 


AGTGTTTACA 


260 


270 


280 


290 


300 


CCCGCAGCGG 


GCCCGGGGGT 


TCGGCCTCAG 


TCAGGCGCTC 


AGCTCCGTTT 


310 


320 


330 


340 


350 


CGTTTCACTT 


CCGGTGGAGG 


GCCGCCTCTG 


AGCGGGCGGC 


GGGCCGACGG 


360 


370 


380 


390 


400 


CGAGCGCGGG 


CGGCGGCGGC 


GGCGGCGGCG 


GCGGCGGCGG 


CGGCGGCGGC 


410 


420 


430 


440 


450 


GGCGGCGGTG 


GCGGCGGCGG 


CGGCGGCGGC 


GGCGGCGGCG 


GCGGCGGCGG 


460 


470 


480 


490 


500 


CGGCGGCGGC 


GGCGGCGGCG 


GCGGCCCGGA 


GCCACCTCTT 


CGGGGGCGGG 


510 


520 


530 


540 


550 


CTCCCGGCGC 


TAGCAGGGCT 


GAAGAGAAGA 


TGGAGGAGCT 


GGTGGTGGAA 


560 


570 


580 


590 


600 


CTGCGGGGCT 


CCAATGGCGC 


TTTCTACAAG 


GTACTTGGCT 


CTAGGGCAGG 
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990 
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CTCCCAGCGG 


GATCCGGGCC 


TGTCGTGTGG 


GTAGTTGTGG 


AGGAGCGGGG 


1010 


1020 


1030 






GGCGCTTCAG 


CCGGGCCACC 


TCCTGCAG . 







20 12. A purified and isolated DNA molecule of less 

than 50 nucleotides flanking a polymorphic microsatellite 
repeat sequence within the DNA molecule of claim 1, * 
wherein said DNA molecule is a primer for amplification 
of said microsatellite repeat sequence. 

25 

13. The DNA molecule of claim 12, wherein said DNA 
molecule comprises the nucleotide sequence {SEQ ID NO 2) 
GATCTAATCA ACATCTATAG ACTTTATT . 



30 



14. The DNA molecule of claim 12, wherein said DNA 
molecule comprises the nucleotide sequence (SEQ ID NO 3) 
AGGCTTGGA GTGCAGTGGG CAATCT . 
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15. The DNA molecule of claim 12 f wherein said 
microsatellite repeat sequence comprises the nucleotide 
sequence (SEQ ID NO 4) 

( GT) n CAGTCTCA CTCTGTCACTC , 
5 wherein n is 1-100. 

16. The DNA molecule of claim 12 , wherein said 
DNA molecule comprises the nucleotide sequence (SEQ ID NO 
5) 

10 GACTGCTCCGG AAGTTGAATC CTCA. 

17. The DNA molecule of claim 12, wherein said 
DNA molecule comprises the nucleotide sequence (SEQ ID NO 
6) 

15 AGACAGGATC TCACTCTGTC ACCTAG. 

18. The DNA molecule of claim 12 , wherein said 
microsatellite repeat sequence comprises the nucleotide 
sequence (SEQ ID NO 7) 

20 GTATTT TTGCAAAGTT TGTCTTTCAG 

TATTTTATTT (GT) n ATATATATAT ATTTTTTTTT TTTTTTTTAA, 
wherein n is 1-100. 

19. The DNA molecule of claim 12, wherein said 
25 DNA molecule comprises the nucleotide sequence (SEQ ID NO 

8) 

GTACTGTATC AGTTATAACC CTATG. 

20. The DNA molecule of claim 12, wherein said 
30 DNA molecule comprises the nucleotide sequence (SEQ ID NO 

9) 

CAAAT TGAAGGTTTG TGGAAACCT. 

21. The DNA molecule of claim 12, wherein said 
35 microsatellite repeat sequence comprises the nucleotide 

sequence (SEQ ID NO 10) 

TGTGT GTGTGC(GT) tt ATGCAT ACCCAAGACT TATCTTATAC 
AGGTATGCCT TGTTTTATTG CACTTTGCAA ATACTGCATT TTTTT, 
wherein n is 1-100, 



WO 92/12262 



PCT/US92/0002O 



35 

22. A purified protein product encoded by the 
DNA of claim 1. 

23. A monoclonal antibody immunologically 
5 reactive with the protein of claim 22. 

24. A polyclonal antibody immunologically 
reactive with the protein of claim 22. 

25. A method to diagnose the presence or 
absence of a Fragile X allele in a subject which method 
comprises digesting the DNA of said subject to obtain DNA 
fragments , separating the fragments according to size, 
and detecting a fragment spanning a DNA sequence 
comprising said allele among the fragments by 
hybridization to the probe of claim 6. 

26. A method to treat mental retardation 
caused by the presence of a Fragile X allele, which 

20 method comprises replacing, repairing or compensating for 
DNA spanning a DNA sequence comprising said allele of a 
subject with the corresponding DNA sequence of a normal 
chromosome. 

27. A method to treat mental retardation 
caused by the presence of a Fragile X allele, which 
method comprises replacing or compensating for protein 
encoded by the DNA sequence comprising said allele of a 
subject with the corresponding protein of a normal 
chromosome . 

28. A method for screening for a Fragile X 
allele in a subject, comprising 



10 



15 



25 



30 
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a) designating a microsatellite repeat 
allele closely linked to Xq27.3 in a family, wherein said 
subject is a member of the family; and 

b) screening said subject for presence of 
5 the microsatellite repeat allele closely linked to 

Xq27.3. 

29. The method of claim 28, wherein step (b) 
further comprises amplification of said microsatellite 

10 repeat sequence with the primer of claim 12. 

30. The method of claim 28, further comprising 
detecting amplification of a p(CC6) B repeat at said 
Xq27.3. 

15 

31. The method of claim 28, wherein detection 
of said repeat further comprises hybridization with the 
probe of claim 7. 

20 32. The method of claim 28, wherein said 

microsatellite repeat allele is closely linked to Xq27.3 
in a population. 
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Figure 6 

j'3 :a 4« sg ga 
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78 65 59 !00 :i0 |-g 
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5 -3 i4 « isa i68 iT3 iae 
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'88 :oe ;ie sre ;io 2*0 

C636SCC3AS 636C73A6CC ■•C5 C55 5Sf . & <?€AnCAGC57 75A7CACS76 ACS7S5777C 

:i3 :ss :?a :se :« :bq 

ASTST-r^CA CCC3CA6C33 5CC:S5Se57 7C35CCTCAS 7C.-.S5C3C7C ASCTCC5TTT 

3!« "3 233 348 358 J 3B8 

CSTTTCACTT CC6S7S6A65 SCCGCCTCTS A6CS65C56C &6&C;5AC66 C5A6C5C666 

370 3S2 333 488 Ai3 XZZ 

CG5C65CSSC 5SC65C33C5 5C55C35CS3 C55CS5CS5C 53C55CS375 3CS6C6SC66 ° 

438 «A8 458 488 4?8 46ft- 

CSSC35CSSC 68C66C68C5 3CS3C66C63 CS8C68CSSC 63C66C86C6 6C66CCC68K 

498 / S80 StQ ,528 838 848 

GCCACCTC7- <C66i36C666 CTCCSS6C6C TASCA666CT 5AA6A6AAS A T66AG6A6C7 

558 SIO S78 588 S88 IM 

66T65736AA 676C886SC7 CSAATS5C3C 77TC7ACAAG 3TACTT66CT C7A6WCA86 

6,9 5:3 839 648 658 ISA 

CCCCA7C77C 6CCC77C:77 C:CTCCC777 7C7TCTT667 67C6SCS68A 65CA8SCCCS 

878 56a 698 786 718 T2 

»»6CCC7C77 CCC6A6CACC 5CSCCT656T 5CCA666CAC 6C7CS6C86S A75rr6TT63 

738 — 7 * 8 7Sa ™ 77B 7|| 
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790 800 618 SIB 836 64« 

SCC7A7CBAA A76A8A6ACC ASCSASSA6A 63677C7C77 7CS6C6CC8A 6CCC6CC663 

3S6 868 878 66Q f96 9M 

676A6C7S53 SATSS6CSA6 63CCSSC66C A667AC7A6A SCC6SSC665 AA666CCSAA 

3,9 9:0 938 648 SS8 9Sfl 

A7C6EC3C7A A6TBACB8C6 A75SC77A77 CCCCC777CC 7AAACA7CA7 C7CCCASCBS 
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6ATCCS38C: 76TCS76756 67A6776766 A66A3C6668 S6C6CTTCA8 CC666CCACC 

««*8 I0S8 t Me 1878 1611 

7CCT6CA6 ' ' ,,Bi 
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Figure 7 

. GATCTAATCA ACATCTATAG ACTTTATT CT GTGTGTGTGT GTGTGTCTGT GTATCTCTGT 

GTCAGTCTCA CTCTGTCACT CAGGCTTGCA GTGCACTCCC CAATCT CTC.T TCACTCC;./.: 

100 

CTCGCCTCCC AGCTTCAAGT CACTCTCATC ATGCCTCAGC CTCCTGAGTA GCTGGGAT7A 

CAGGCATGCA CCACCACACC CAGCTAATTT TTTGCA7TTT TAGTAGAGTC GCCATTTCAC 

200 

TATGTTGGCC AGGCTGCTCT CGAACTTCTG GCCTCAAGTG ATC 

GGCCCTAATC AGATTTCCAC AAATTCTGAC TTAATATTTG CCCGCTTATA TAACAGCTCT 
TCTTTAACAA A/.ACAAGTAC TTTTCTCAAT AGAATTTTAC TAAGAAACCT CTTTAGTAAA 
ACATCGACAT TATACATACA ACATATCTCA GTATCTGCTG ATGAAGAACA CCAAAAAGAA 
CCCAGATG TG ACTGCTCCGG AAGTTGAATC CTCA GTATTT TTGCAAAGTT TGTCTTTCAG 

TATTTTATTT GTGTGTGTGT GTGTGTGTGT GTGTCTCTCT ATATATATAT ATT ! If HI T 

100 

HTTI11IAA ACACAGGATC TCACTCTCTC ACCTACG CTR CACTCiCAGTG CATGATCATG 

GTACTGTATC AGTTATAACC CTATGTfiTftT GTCTCCGTGT CTGTCTGTCT GTCTATGCAT 

ACCCAAGACT TATCTTATAC ACGTATGCCT TGTTTTATTC CACTTTGCAA ATACTGCATT 

100 

TTTTTCAAAT TGAAGGTTTG TGGAAACCT T TTTTTTGAGC AATTCTGTAG TGCCATTTTT 
TTCAACGGCA TGTGTAC 
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